Pardubice Region
Approximating Human Preferences Using a Multi-Judge Learned System
Sprejer, Eitán, Avalos, Fernando, Bernardi, Augusto, Faustino, Jose Pedro Brito de Azevedo, Haimes, Jacob, Oozeer, Narmeen Fatimah
Aligning LLM-based judges with human preferences is a significant challenge, as they are difficult to calibrate and often suffer from rubric sensitivity, bias, and instability. Overcoming this challenge advances key applications, such as creating reliable reward models for Reinforcement Learning from Human Feedback (RLHF) and building effective routing systems that select the best-suited model for a given user query. In this work, we propose a framework for modeling diverse, persona-based preferences by learning to aggregate outputs from multiple rubric-conditioned judges. We investigate the performance of this approach against naive baselines and assess its robustness through case studies on both human and LLM-judges biases. Our primary contributions include a persona-based method for synthesizing preference labels at scale and two distinct implementations of our aggregator: Generalized Additive Model (GAM) and a Multi-Layer Perceptron (MLP).
- North America > United States > California > San Francisco County > San Francisco (0.14)
- South America > Brazil > São Paulo (0.04)
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (0.93)
A Cost-Effective Framework for Predicting Parking Availability Using Geospatial Data and Machine Learning
Bagosher, Madyan, Mustafa, Tala, Alsmirat, Mohammad, Al-Ali, Amal, Jawarneh, Isam Mashhour Al
As urban populations continue to grow, cities face numerous challenges in managing parking and determining occupancy. This issue is particularly pronounced in university campuses, where students need to find vacant parking spots quickly and conveniently during class timings. The limited availability of parking spaces on campuses underscores the necessity of implementing efficient systems to allocate vacant parking spots effectively. We propose a smart framework that integrates multiple data sources, including street maps, mobility, and meteorological data, through a spatial join operation to capture parking behavior and vehicle movement patterns over the span of 3 consecutive days with an hourly duration between 7AM till 3PM. The system will not require any sensing tools to be installed in the street or in the parking area to provide its services since all the data needed will be collected using location services. The framework will use the expected parking entrance and time to specify a suitable parking area. Several forecasting models, namely, Linear Regression, Support Vector Regression (SVR), Random Forest Regression (RFR), and Long Short-Term Memory (LSTM), are evaluated. Hyperparameter tuning was employed using grid search, and model performance is assessed using Root Mean Squared Error (RMSE), Mean Absolute Error (MAE) and Coefficient of Determination (R2). Random Forest Regression achieved the lowest RMSE of 0.142 and highest R2 of 0.582. However, given the time-series nature of the task, an LSTM model may perform better with additional data and longer timesteps.
- North America > United States > Texas (0.14)
- Asia > Middle East > UAE > Sharjah Emirate > Sharjah (0.05)
- Europe > Czechia > Pardubice Region > Pardubice (0.05)
- (3 more...)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Ground > Road (1.00)
Political Leaning and Politicalness Classification of Texts
This paper addresses the challenge of automatically classifying text according to political leaning and politicalness using transformer models. We compose a comprehensive overview of existing datasets and models for these tasks, finding that current approaches create siloed solutions that perform poorly on out-of-distribution texts. To address this limitation, we compile a diverse dataset by combining 12 datasets for political leaning classification and creating a new dataset for politicalness by extending 18 existing datasets with the appropriate label. Through extensive benchmarking with leave-one-in and leave-one-out methodologies, we evaluate the performance of existing models and train new ones with enhanced generalization capabilities.
- North America > United States > Maryland > Baltimore (0.14)
- Europe > Spain (0.04)
- North America > United States > Washington > King County > Seattle (0.04)
- (12 more...)
- Government (1.00)
- Media > News (0.93)
Unveiling Dual Quality in Product Reviews: An NLP-Based Approach
Poświata, Rafał, Mirończuk, Marcin Michał, Dadas, Sławomir, Grębowiec, Małgorzata, Perełkiewicz, Michał
Consumers often face inconsistent product quality, particularly when identical products vary between markets, a situation known as the dual quality problem. To identify and address this issue, automated techniques are needed. This paper explores how natural language processing (NLP) can aid in detecting such discrepancies and presents the full process of developing a solution. First, we describe in detail the creation of a new Polish-language dataset with 1,957 reviews, 540 highlighting dual quality issues. We then discuss experiments with various approaches like SetFit with sentence-transformers, transformer-based encoders, and LLMs, including error analysis and robustness verification. Additionally, we evaluate multilingual transfer using a subset of opinions in English, French, and German. The paper concludes with insights on deployment and practical applications.
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > Canada > Ontario > Toronto (0.05)
- (15 more...)
- Consumer Products & Services (0.68)
- Law (0.68)
- Government > Regional Government (0.47)
A Personal data Value at Risk Approach
What if the main data protection vulnerability is risk management? Data Protection merges three disciplines: data protection law, information security, and risk management. Nonetheless, very little research has been made on the field of data protection risk management, where subjectivity and superficiality are the dominant state of the art. Since the GDPR tells you what to do, but not how to do it, the solution for approaching GDPR compliance is still a gray zone, where the trend is using the rule of thumb. Considering that the most important goal of risk management is to reduce uncertainty in order to take informed decisions, risk management for the protection of the rights and freedoms of the data subjects cannot be disconnected from the impact materialization that data controllers and processors need to assess. This paper proposes a quantitative approach to data protection risk-based compliance from a data controllers perspective, with the aim of proposing a mindset change, where data protection impact assessments can be improved by using data protection analytics, quantitative risk analysis, and calibrating expert opinions.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.05)
- Europe > France > Hauts-de-France > Nord > Lille (0.05)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- (7 more...)
An Overview and Comparison of Axiomatization Structures Regarding Inconsistency Indices' Properties in Pairwise Comparisons Methods
Pant, Sangeeta, Kumar, Anuj, Mazurek, Jiří
Mathematical analysis of the analytic hierarchy process (AHP) led to the development of a mathematical function, usually called the inconsistency index, which has the center role in measuring the inconsistency of the judgements in AHP. Inconsistency index is a mathematical function which maps every pairwise comparison matrix (PCM) into a real number. An inconsistency index can be considered more trustworthy when it satisfies a set of suitable properties. Therefore, the research community has been trying to postulate a set of desirable rules (axioms, properties) for inconsistency indices. Subsequently, many axiomatic frameworks for these functions have been suggested independently, however, the literature on the topic is fragmented and missing a broader framework. Therefore, the objective of this article is twofold. Firstly, we provide a comprehensive review of the advancements in the axiomatization of inconsistency indices' properties during the last decade. Secondly, we provide a comparison and discussion of the aforementioned axiomatic structures along with directions of the future research.
- Asia > India > Maharashtra (0.04)
- North America > United States > New York (0.04)
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- (2 more...)
- Overview (1.00)
- Research Report > New Finding (0.46)
From the evolution of public data ecosystems to the evolving horizons of the forward-looking intelligent public data ecosystem empowered by emerging technologies
Nikiforova, Anastasija, Lnenicka, Martin, Milić, Petar, Luterek, Mariusz, Bolívar, Manuel Pedro Rodríguez
Public data ecosystems (PDEs) represent complex socio-technical systems crucial for optimizing data use in the public sector and outside it. Recognizing their multifaceted nature, previous research pro-posed a six-generation Evolutionary Model of Public Data Ecosystems (EMPDE). Designed as a result of a systematic literature review on the topic spanning three decade, this model, while theoretically robust, necessitates empirical validation to enhance its practical applicability. This study addresses this gap by validating the theoretical model through a real-life examination in five European countries - Latvia, Serbia, Czech Republic, Spain, and Poland. This empirical validation provides insights into PDEs dynamics and variations of implementations across contexts, particularly focusing on the 6th generation of forward-looking PDE generation named "Intelligent Public Data Generation" that represents a paradigm shift driven by emerging technologies such as cloud computing, Artificial Intelligence, Natural Language Processing tools, Generative AI, and Large Language Models (LLM) with potential to contribute to both automation and augmentation of business processes within these ecosystems. By transcending their traditional status as a mere component, evolving into both an actor and a stakeholder simultaneously, these technologies catalyze innovation and progress, enhancing PDE management strategies to align with societal, regulatory, and technical imperatives in the digital era.
- Europe > Serbia (0.34)
- Europe > Latvia (0.25)
- North America > United States > Hawaii (0.04)
- (9 more...)
- Information Technology (1.00)
- Government > E-government (0.47)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)
- Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.37)
Consistent Collaborative Filtering via Tensor Decomposition
Zhao, Shiwen, Crissman, Charles, Sapiro, Guillermo R
Collaborative filtering is the de facto standard for analyzing users' activities and building recommendation systems for items. In this work we develop Sliced Anti-symmetric Decomposition (SAD), a new model for collaborative filtering based on implicit feedback. In contrast to traditional techniques where a latent representation of users (user vectors) and items (item vectors) are estimated, SAD introduces one additional latent vector to each item, using a novel three-way tensor view of user-item interactions. This new vector extends user-item preferences calculated by standard dot products to general inner products, producing interactions between items when evaluating their relative preferences. SAD reduces to state-of-the-art (SOTA) collaborative filtering models when the vector collapses to one, while in this paper we allow its value to be estimated from data. The proposed SAD model is simple, resulting in an efficient group stochastic gradient descent (SGD) algorithm. We demonstrate the efficiency of SAD in both simulated and real world datasets containing over 1M user-item interactions. By comparing SAD with seven alternative SOTA collaborative filtering models, we show that SAD is able to more consistently estimate personalized preferences.
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- Europe > Czechia > Pardubice Region > Pardubice (0.04)
- Media (0.48)
- Leisure & Entertainment (0.47)
- Information Technology (0.47)
- (2 more...)
Profitable Strategy Design by Using Deep Reinforcement Learning for Trades on Cryptocurrency Markets
Asgari, Mohsen, Khasteh, Seyed Hossein
Deep Reinforcement Learning solutions have been applied to different control problems with outperforming and promising results. In this research work we have applied Proximal Policy Optimization, Soft Actor-Critic and Generative Adversarial Imitation Learning to strategy design problem of three cryptocurrency markets. Our input data includes price data and technical indicators. We have implemented a Gym environment based on cryptocurrency markets to be used with the algorithms. Our test results on unseen data shows a great potential for this approach in helping investors with an expert system to exploit the market and gain profit. Our highest gain for an unseen 66 day span is 4850 US dollars per 10000 US dollars investment. We also discuss on how a specific hyperparameter in the environment design can be used to adjust risk in the generated strategies.
- Asia > Middle East > Jordan (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > New Jersey (0.04)
- (3 more...)
The state-of-the-art in text-based automatic personality prediction
Feizi-Derakhshi, Ali-Reza, Feizi-Derakhshi, Mohammad-Reza, Ramezani, Majid, Nikzad-Khasmakhi, Narjes, Asgari-Chenaghlu, Meysam, Akan, Taymaz, Ranjbar-Khadivi, Mehrdad, Zafarni-Moattar, Elnaz, Jahanbakhsh-Naghadeh, Zoleikha
The above quotation becomes the basis of what is present in this article, studying natural language processing in individual personality. Personality is defined as the characteristic set of behaviours, cognitions, and emotional patterns [1] as well as thinking patterns [2], and its external appearance can be seen in writing, speech, decision and other aspects of the social and personal lives of people. Language is the most prominent and the most available aspects of individuals' personality. Meanwhile, written text is one of the most utilized appearance of language. Developing the Internet based infrastructure such as social media, e-mails, and different texting contexts, have made the language appearance of people more available. Consequently, considering the increasing of internet based communications, it would be so exciting to became aware of individuals' personality, inspite of their absence. Therefore, the involvement of computers in determining the personality of people seems necessary and turned into a study field in computer science. Automatic Personality Prediction (or Perception) (APP) is the automatic prediction of the personality of individuals and usually done by computers.
- North America > United States > New York > New York County > New York City (0.14)
- Asia > Middle East > Iran > East Azerbaijan Province > Tabriz (0.04)
- Europe > Czechia > Pardubice Region > Pardubice (0.04)
- (13 more...)